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ABSTRACT 

Many recent models consider the structure of individual interstellar medium (ISM) clouds as a way 
to explain observations of large parts of galaxies. To compare such models to observations, one must 
understand how to translate between surface densities observed averaging over large (~ kpc) scales 
and surface densities on the scale of individual clouds (~ pc scale), which are treated by models. We 
define a "clumping factor" that captures this translation as the ratio of the mass-weighted surface 
density, which is often the quantity of physical interest, to the area- weighted surface density, which 
is observed. We use high spatial resolution (sub-kpc) maps of CO and H I emission from nearby 
galaxies to measure the clumping factor of both atomic and molecular gas. The molecular and atomic 
ISM exhibit dramatically different degrees of clumping. As a result, the ratio H2/H I measured at ~ 
kpc resolution cannot be trivially interpreted as a cloud-scale ratio of surface densities. H I emission 
appears very smooth, with a clumping factor of only ~ 1.3. Based on the scarce and heterogeneous 
high resolution data available, CO emission is far more clumped with a widely variable clumping 
factor, median ~ 7 for our heterogeneous data. Our measurements do not provide evidence for a 
universal mass- weighted surface density of molecular gas, but also cannot conclusively rule out such a 
scenario. We suggest that a more sophisticated treatment of molecular ISM structure, one informed 
by high spatial resolution CO maps, is needed to link cloud-scale models to kpc-scale observations of 
galaxies. 



1. CLUMPING AND SURFACE DENSITIES IN ISM MAPS 

Observations of atomic and molecular gas now achieve 
spatial resolution of several hundred parsecs to a few 
kiloparsecs in large samples of nearby galaxies (e.g . 
IHelfer et all 12003 I Walter et al.ll200i ILerov et all 120031 
or even small sets of high redshift galaxies (jHodge et al.1 
120121 [Tacconi et al.ll2012l ). Such observations isolate key 
physical conditions such as stellar surface density, metal- 
licity, or the interstellar radiation field. However, with 
a few exceptions, these observations still do not resolve 
individual clouds of atomic and molecular gas, which are 
often considered to be the fundamental units of the in- 
terstellar medium (ISM). 

The interpretation of these observations often utilizes 

predictions from models that treat the surface density 

of gas on the scale of individual clou ds. For example 

the m o dels of IKrumholz et al.l (l2009alfbT). IWolfire et af 

20101 ). iFeldmann et al.l ( 20121 ). and [Narayana n et al 



(2012) all consider the structure of individual photodisso- 
ciation regions or atomic-molecular complexes to explain 
observations on the scale of galaxies. In these models, the 
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surface density of an individual cloud represents a key pa- 
rameter, often because it indicates the degree of shielding 
from the ambient radiation field. Because these models 
focus on cloud structure the mapping between the readily 
observed average, or "area-weighted," surface density at 
~ kpc resolution and the cloud-scale, "mass-weighted," 
surface density represents an essential component of com- 
paring observations and theory. This mapping is often 
referred to as "clumping" and quantified via a "clumping 
factor." 

For the most part the adopted clumping factors rep- 
resent guesses informed by our coarse knowledge of ISM 
structure and giant molecular clouds (GMCs) but not 
directly based on observations. However, this factor can 
also be directly measured from high spatial resolution 
data. In this letter we collect a large set of observa- 
tions to measure the relationship between the surface 
density of the ISM averaged over large (~ kpc) scales 
and the "true" small scale surface density. We consider 
both atomic (H I) and molecular (H 2 , traced by CO) gas 
and discuss the implications of our calculation for the 
comparison to models. 

We cast this discussion in terms of three quantities: the 
mass-weighted average surface density, (S) , the area- 
weighted average surface density, (S) , and a clumping 
factor, c, relating the two. The mass-weighted average 
surface density is 



(s) m = L SxSW 



I A ^ dA 

j A Y,dA f A XdA 



(1) 



where £ is the true gas mass surface density along a line 
of sight and the integral occurs over some area element 
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A. Then the denominator is simply the sum of gas in 
that area and the calculation returns the mass-weighted 

average surface density over the area. That is (E) is 
the column density at which most mass exists. Contrast 
this quantity with what is observed by a telescope for 
which a resolution element has size A, 



.\ L^ dA 



<£> = 



I A dA 



(2) 



That is, the telescope observes the area- weighted average 
surface density within the beam. 

These two quantities, (E) and (E) , will be the same 
for a smooth medium. They differ for a clumpy medium 
with most of the mass in small, high column density re- 
gions spread over large, low column density areas. In this 
case, (E) may be much lower than (E) . We define a 
clumping factor, c, to quantify this distinction as: 



(2> 



M 



(W 



(3) 



c will be high for a clumpy, inhomogeneous medium and 
low for a smooth medium. It will never fall below unity. 
In practice, (E) will be derived at finite resolution, so 
that c could be more rigorously defined as c b a , the clump- 
ing factor calculated at final resolution b with (E) de- 
rived from data with intrinsic resolution a. In this paper, 
b will always be 1 kpc; a will vary from data set to data 
set. 

The clumping factor, c, (E) , and (E) give us a for- 
malism to ask several questions related to the structure 
of the ISM and the interpretation of ~ kpc resolution 
elements: 

• How does the mass-weighted (E) relate to the 

area- weighted, observable (E) for kpc-resolution 
measurements of atomic and molecular gas in 
galaxies? What are typical clumping factors? 

• Is clumping the same for atomic and molecular gas, 
so that the H^-to-H I ratio at large scales may be 
readily interpreted in terms of cloud structure? 

• Can one reliably predict the surface density of indi- 
vidual regions — relevant to PDR and cloud struc- 
ture calculations — from coarse resolution mea- 
surements? 



2. DATA AND CALCULATIONS 

We assemble all readily available high spatial resolu- 
tion (< 500 pc) CO and H I maps of nearby galaxies and 

use these to calculate (E) , (E) , and c. We make use 
of three recent H I s urveys of nearby galax ies: THINGS 
(I Walter et al. 1 12001 . LITT LE THINGS (IHunter et al.l 
12012J), and VLA ANGST (jOtt et al.ll2012D . We sirrT 
plement these with a collection of H I data obtained to 
complement the HERACLES CO survey (presented in 
Leroy et al.ll2012t iSchruba et al.|[20TTI: iSandstrom et al " 



2012 T) and WSR T maps of M33 jDeul fe van der Hulst 
1981 and M31 ([Brinks fc Shane! 11984ft . Whenever pos- 
sible, we use the naturally weighted data. We include 
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Fig. 1. — Mass- weighted surface density at 1-kpc resolution (y- 
axis) , Equation [l] as a function of area- weighted surface density, 
Equation [2] at the same resolution (x-axis) — note that no incli- 
nation corrections are applied, broadening the spread of apparent 
column densities. That is, the true surface density from which most 
emission arises as a function of the column density that would be 
measured at 1 kpc resolution. Blue points show H I data, red points 
show H2 estimates from CO emission. Light points show individ- 
ual lines of sight, dark points show averages for whole data sets. 
Gray lines show fixed clumping factors spaced by a factor of 2. Hi 
shows good tracking between (£) and (E) with very low clump- 
ing factors, seldom above a factor of two. Conversely, CO exhibits 
a high degree of clumping, almost never less than a factor of two 
but often more than a factor of ten and a scattered, non-universal 
relation between (S) M and <£) A . 
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Fig. 2. — Clumping factor (j/-axis), Equation(3] as a function of 
the linear resolution of the data used to derive the mass- weighted 
surface density, (S) , i.e., the native resolution of the data. The 
color scheme follows Figure [l] Again, H2 traced by CO appears 
much more strongly clumped than H I, exhibiting a wide range of 
clumping factors and tending to show a high degree of clumping. 
H I, by contrast, appears remarkably smooth, seldom exceeding 
clumping factors of two even at high spatial resolution. As in 
Figure [T] the difference in structure of the atomic and molecular 
medium and the variable dumpiness of the molecular medium are 
clearly evident. 
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all galaxies from these surveys that have linear resolu- 
tion better than 500 pc and inclination less than 50° (we 
except M31 and M33 from the inclination requirement). 
For the H I calculation we consider only regions inside 
r25 with column densities iV(H) > 10 20 cm~ 2 . We use 
the integrated intensity maps provided by each survey in 
its data release. 

High spatial resolution CO data remains harder to 
come by than high resolution H I data because nearby 
dwarf galaxies tend to be faint in CO emission and the 
sensitivity of mm- wave telescopes has been limited before 
ALMA. This scarcity leads us to assemble a heteroge- 
neous coll ection of high reso lution CO. This includes the 
MAG MA (jWong et al.l20ll and NANTEN (|Fukui et al.l 
[19991) sur veys of the LMC, t he IRAM 30-m survey of CO 
in M31 (jNieten et al.l 120061) . the combine d BIMA and 
FCRAO survey of M33 (iRosolowskvi 120071 ). ALMA sci- 
ence verification data on the Antennae galaxies, and a 
handful of the brightest an d nearest galaxies from BIMA 
SONG ([Heifer et all [20031 NGC 2903, 3627, 5194, and 
6946). We supplement these with two new datasets: high 
resolution CARMA mapping of select fields in M31 (PI: 
A. Schruba, Schruba et al. in prep.) and the Plateau de 
Bure Arcsecond Whirlpool Survey (PAWS, Schinnerer 
et al. submitted, Pety et al. accepted) of M51. Ex- 
cept for the ALMA Antennae data, all of these data sets 
target the CO J = 1 — > line and include (sometimes 
exclusively) short-spacing data; the Antennae data tar- 
get the CO J = 3 ->• 2 and CO J = 2 -> 1 transition^. 
Note that we have multiple data sets on several galaxies 
(M31, the LMC, M51) and treat each data set, rather 
than galaxy, as a separate measurement. For compari- 
son, we also calcul ate c from the comp osite CO survey of 
the Milky Way by iDame et al.l ((20011 ). considering only 
intermediate latitude (30° > \b\ > 5°) gas. We smooth 
their data with a 1.25° kernel to minimize sampling issues 
and integrate only over areas covered by the surveys. 

Calculating Moment Zero Maps: Because of the E 2 
term in Equation [TJ (E) is not robust to the inclusion 
of noise in the calculation. We must therefore mask the 
data before carrying out our calculations. This is mostly 
an issue for the CO data, as the integrated intensity maps 
provided by the H I surveys have sufficient S/N for our 
purposes. For the CO, we create new masks and re-derive 
integrated intensity maps for each data set. Typically we 
estimate the noise from the empty regions of the cube, 
identify a core of high significance emission, often two 
successive channels with S/N > 5 and expand this high 
significance core to include fainter but still significant 
emission. We integrate the masked data cube to pro- 
duce a moment map, which we use in further analysis. 
For the LMC maps from MAGMA and NANTEN and 
the M33 map, the signal to noise at the native resolution 
is too low to yield a high quality masked integrated in- 
tensity map. Therefore we convolve the data to a slightly 
worse resolution before masking and any analysis. 

This exercise produces maps well-suited to derive 
(E) , but the process of masking at the native resolution 

8 For purposes of calculating surface densities, we assume these 
lines to be thermalized. In actuality, they are likely somewhat sub- 
thermal but uncertainty is likely offset by a somewhat lower apo 
in the Antennae. In any case, these conversion factors effectively 
divide out when calculating c. 



does remove the possibility of picking up any contribu- 
tion from a low S/N diffuse component (e.g., Pety et al., 
accepted) . Fundamentally, this is a limitation of the data 
themselves and future, more sensitive surveys capable of 
detecting diffuse emission over individual lines of sight 
will improve this situation and quantify the contribution 
of faint, pervasive CO emission to the total molecular gas 
budgefl 

When sampling the CO emission, we restrict ourselves 
to areas that include significant emission within the 
mask. Roughly, our criterion is that in the map smoothed 
to 1 kpc resolution, the average brightness is such that 
we could have detected that line of sight at the original, 
higher resolution. That is, we consider areas where our 
sensitivity at high resolution is sufficient to detect the 
average brightness. This allows us to avoid "edge" or 
"clipping" effects in which only one small patch of bright 
emission is included in the beam, leading to high (E) 

but low (E) . Because these "edges" are mostly present 
(within ^5) in the CO and not H I maps, including them 
would make our conclusions more extreme. 

Deriving (E) , (E) , and c: We assume that 21- 
cm and CO emission linearly trace the surface density 
of atomic and molecular gas, i.e., E oc J, and cal- 
culate (E) and (E) following Equations [T] and [2j 
We convert from intensity to surface density adopt- 
ing a fixed aco — 4.35 M Q pc -2 (K km s" 1 )" 1 and 
TV (HI) [cm" 2 ] = 1.823 x 10 18 / H i [Kkms" 1 ]. Note 
that these factors divide out when calculating c. 

To calculate (E) , we smooth from the native resolu- 
tion to 1 kpc using a normalized Gaussian kernel. To 
calculate (E) , we calculate E 2 at the native resolu- 
tion, convolve this map to 1 kpc resolution using a nor- 
malized Gaussian kernel, and then divide that map by 
(E) following Equation Q] We record (E) , (E) , and 
the clumping factor, c, for a hexagonally-spaced set of 
Nyquist-sampled (at 1 kpc resolution) points. 

After this exercise, we have w 50, 000 data points from 
46 galaxies for H I and sw 1, 000 data points from 15 data 
sets in 8 galaxies for CO. As these numbers make clear, 
CO data represent the limiting reagent in this calcula- 
tion, though thanks to ALMA their prospect for short- 
term improvement is excellent . 

3. RESULTS 
Figures Q] and [5] and Table Q] report our results. Figure 
[I] shows (E) as a function of (E) for CO (red) and 
H I (blue) data. Figure [5] plots the clumping factor, c, 
as a function of the linear resolution of the original data 
set used to calculate (E) . In both figures, light points 
show individual lines of sight and dark, solid points show 
averages for whole data sets. Error bars on the whole 
galaxy points in Figure [5] show the la range for that 
data set. Table Q] reports the native resolution (after 
convolution to increase the S/N in M33 and the LMC) 
and median clumping factor with ltr range for each CO 

9 This effect matters but does not appear to dominate our re- 
sults. For example, if we add a pervasive CO component to the 
PAWS M51 data with magnitude ~ 0.5 times our sensitivity — 
an aggressive scenario — then c drops from ss 6 to ss 5.1 over the 
region that we consider. Fainter regions, which we avoid, will be 
more affected. 
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TABLE 1 
Clumping Factors for ISM Maps 



Data Set 

(1) 


res. 

[PC] 

(2) 


(3) 


CO Data 


M31 IRAM 30-m 


87 


q 7 +1.6 


M31 CARMA "Brick 9" 


22 


8.43:3 


M31 CARMA "Brick 15" 


21 


6.4iii 


M33 BIMA+FCRAO 


98 


iq+6.9 


LMC NANTEN 


58 


oq + 15 
•M-12 


LMC MAGMA 


15 


,-,+8.0 
Ji -13 


M51 PAWS 


39 


6.0iJ; 7 


NGC 2903 BIMA SONG 


264 


4 2+ 4 - 2 


NGC 3627 BIMA SONG 


295 


r> 7 + 2.2 
z -'-0.9 


M51 BIMA SONG 


200 


3.5±?:S 


NGC 6946 BIMA SONG 


145 


™±l:l 


Antennae CO(2-l) North 


135 


io+6.7 
ic> -7.4 


Antennae CO(2-l) South 
Antennae CO(3-2) North 
Antennae CO(3-2) South 
Local Milky Way (30° > |6| > 5°) 


127 
95 

87 


7 1+6.8 

7 -1+12.4 
'• i -9.6 

9.8i»;| 

^6 


HI ensemble 




1 2fi+ 046 

1 - ZD -0.15 


... 0-250 pc resolution 
... 250-500 pc resolution 




1 Q4+0.32 

1 1Q+0.18 
i ' iy -0.11 



data set. We report results for the ensemble of H I data, 
which Figure [1] and [2] demonstrate to be uniform. 
These figures illustrate three points: 

1. H I and H2 (traced by CO) exhibit different clump- 
ing factors. Both Figure Q] and Figure [2] show 
that essentially all of our CO data is more highly 
clumped than all of our H I data. The median 
clumping factor for a CO data set is c = 7, 
while the median clumping factor for an H I data 
set is c = 1.26. The specific cases of M33 and 
M31 illustrate this point cleanly. The M31 IRAM 
CO map shows clumping factor rs 4; the M33 
BIMA+FCRAO map shows clumping factor rj 13. 
Both the M31 and M33 H I maps show clumping 
factor w 1.3. 

As a direct result, a ratio of H2-to-H I surface densi- 
ties obtained at large scales does not translate triv- 
ially into a ratio of surface densities at small scales. 
The assumption that the large scale surface density 
in galaxies reflects the small scale surface density 
in the same wa y for H 2 and H I underli es the ap- 
plication of the iKrumholz et al.l (|2009a| ) model to 
explain H2-to-H I ratios in galaxies. Though the 
physics of the model appear to ap ply successfully 
to individual c louds or regions (Bola tto et al.|[20Tlb 
iLee et all2012| ) , we suggest that more than a single 
"clumping" factor is necessary to make a rigorous 
comparison of the model to observations of large 
parts of galaxies. 

Our calculation does not invalidate the kpc- 
resolution ratio of Ejk/Ehi as an interesting mea- 
surement. It simply suggests that this be viewed 
as a measure of mass balance among ISM phases 



over a large area and not indicative of small-scale 
ISM structure. 

2. H I is very smooth. This conclusion leaps out 
of both figures. Even with high linear resolu- 
tion, H I column density maps remain smooth and 
only weakly clumped. This can be explained by 
most 21-cm emission originating not from clumped 
bound clouds, but a diffuse medium with a high 
volume filling factor. The H I clumping factor does 
depend weakly on scale, a reasonable functional 



form is c = 392 (Z pc + 100) 



-1.27 



1, where L 



the (FWHM) linear resolution in parsecs. 



3. CO is clumpy with a wide range of c, making it 
hard to predict (E) from (E) . In contrast to 
HI, H2 traced by CO emission appears clumpy 
with a wide range of c. The median among all data 
sets is « 7 with a factor of 2-3 rms scatter among 
measurements. This is also close to the value 
that we estimate for the Solar Neighborhood from 
intermediate latitude gas, but we caution that we 
expect c to change with improving native resolu- 
tion of the data. Because we consider only bright 
regions, this represents a conservative estimate; 
the edges and faint regions that we exclude tend 
to have high c. The molecule-poor systems (M33 
and the LMC) in the sample show the highest 
c, perhaps because they contain more isolated 
clouds and perhaps because their somewhat low 
metallicities lead any diffuse H2 component to emit 
less in CO (below some metallicity the clumping 
of CO emission and Ejj2 will dramatically diverge). 



We also note two less secure points implied by the data 
but requiring more aggressive assumptions about the 
CO-to-H2 conversion factor: 

4. The total (H2+ H I) clumping factor must vary sig- 
nificantly among and within galaxies. We can only 
calculate the clumping factor for the total (H2 + 
H I) gas in M31, M33, and the LMC. In each case 
the median c is low (~ 1.3, ~ 1.3, and ~ 1.5), 
resembling that of the H I. This is not surpris- 
ing because for fixed aco , the H I mass exceeds 
the H2 mass in these galaxies by more than an or- 
der of magnitude, with H2 making up most of the 
gas along only a small fraction of the lines of sight 
at our resolution (M31 is more molecule-rich than 
the other two, but still H I dominated). Generally, 
we expect that across most of the area in dwarf 
galaxies, which tend to be low-metallicity and H I- 
dominated, c will resemble the ~ 1.3 that we mea- 
sure for H I. The outer parts of most spirals also 
tend to be H I dominated and should show similar 
values, while in the molecule-rich central parts of 
actively star-forming galaxies the values will more 
closely resemble the higher c that we find for M51, 
NGC 2903, NGC 3627, and NGC 6946. 

5. CO exhibits a wide range of (E) . In contrast to 
the common assumption that CO emerges from a 
population of fixed surface density clouds, the CO 
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data in Figure Q] span two orders of magnitude 
in (£) . The figure provides no good evidence 

that CO emerges from a fixed (£} at high spa- 
tial resolution. In fact, the highest resolution data 
sets span roughly an order of magnitude (for fixed 

a co ) in (S) M from the LMC (~ 25 M Q pc~ 2 ) to 

M51 (~ 330 M pc~ 2 ). The spread in (E) M may 
arise from sources other than the cloud scale sur- 
face density: variations in inclination and the con- 
version factor, superposition of fixed surface den- 
sity clouds, the convolution of bound clouds with 
a diffuse background, or a lack of spatial resolu- 
tion matched to individual clouds. However, our 
best guess is that the large range in apparent (E) 
visible in Figure Q] in fact reflects a systematic 
dependence of cloud surface density on environ- 
ment. This reinforces the thorough analysis of 
Hughes et al. (accepted), who compare CO maps 
of M51 (PAWS), the LMC (MAGMA), and M33 
and conclusively demonstrate fundamental differ- 
ences among the volume and surface density PDFs 
among and within the three galaxie s (see also the 
sprea d in Milky Way £ discussed by iBolatto et al.l 
l2013h . 

Our calculations shows that the structure of the molec- 
ular and atomic ISM are more complex than has been 
assumed while vetting recent models. We suggest the 
"clumping factor" approach defined in [JT]to quantify this 
structure and aid interpretation of lower resolution ob- 



servations. With ALMA now able to easily obtain high 
resolution, high sensitivity ISM maps we expect such cal- 
culations to be feasible in many systems over the coming 
years. 
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